Dataset statistics
| Number of variables | 35 |
|---|---|
| Number of observations | 66940 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 16.5 MiB |
| Average record size in memory | 259.0 B |
Variable types
| Categorical | 23 |
|---|---|
| Numeric | 9 |
| Boolean | 3 |
사고번호 has a high cardinality: 66940 distinct values | High cardinality |
시군구_소범주 has a high cardinality: 465 distinct values | High cardinality |
사고요일 is highly correlated with 주말여부 | High correlation |
중상자수 is highly correlated with 경상자수 and 1 other fields | High correlation |
경상자수 is highly correlated with 중상자수 and 1 other fields | High correlation |
부상신고자수 is highly correlated with 대형사고여부 | High correlation |
사고내용 is highly correlated with 사망자수 and 2 other fields | High correlation |
사망자수 is highly correlated with 사고내용 and 1 other fields | High correlation |
사고유형_대범주 is highly correlated with 사고유형_소범주 and 1 other fields | High correlation |
사고유형_소범주 is highly correlated with 사고유형_대범주 and 2 other fields | High correlation |
도로형태_대범주 is highly correlated with 도로형태_소범주 | High correlation |
도로형태_소범주 is highly correlated with 도로형태_대범주 and 1 other fields | High correlation |
노면상태_대범주 is highly correlated with 노면상태_소범주 | High correlation |
노면상태_소범주 is highly correlated with 노면상태_대범주 and 1 other fields | High correlation |
피해운전자차종 is highly correlated with 사고유형_대범주 and 1 other fields | High correlation |
피해운전자상해정도 is highly correlated with 사고내용 and 1 other fields | High correlation |
주말여부 is highly correlated with 사고요일 | High correlation |
대형사고여부 is highly correlated with 중상자수 and 2 other fields | High correlation |
법규위반 is highly correlated with 사고유형_소범주 and 1 other fields | High correlation |
기상상태 is highly correlated with 노면상태_소범주 | High correlation |
가해운전자상해정도 is highly correlated with 사고내용 | High correlation |
사고번호 is uniformly distributed | Uniform |
사고번호 has unique values | Unique |
사고시각 has 2018 (3.0%) zeros | Zeros |
사고요일 has 9625 (14.4%) zeros | Zeros |
중상자수 has 50625 (75.6%) zeros | Zeros |
경상자수 has 17900 (26.7%) zeros | Zeros |
부상신고자수 has 60363 (90.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-19 02:20:45.485447 |
|---|---|
| Analysis finished | 2022-11-19 02:21:32.691703 |
| Duration | 47.21 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 66940 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| A2019010100100001 | 1 |
|---|---|
| A2020042300100046 | 1 |
| A2020042300100063 | 1 |
| A2020042300100064 | 1 |
| A2020042300100065 | 1 |
| Other values (66935) |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Characters and Unicode
| Total characters | 1137980 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 66940 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | A2019010100100001 |
|---|---|
| 2nd row | A2019010100100002 |
| 3rd row | A2019010100100003 |
| 4th row | A2019010100100019 |
| 5th row | A2019010100100020 |
Common Values
| Value | Count | Frequency (%) |
| A2019010100100001 | 1 | < 0.1% |
| A2020042300100046 | 1 | < 0.1% |
| A2020042300100063 | 1 | < 0.1% |
| A2020042300100064 | 1 | < 0.1% |
| A2020042300100065 | 1 | < 0.1% |
| A2020042300100066 | 1 | < 0.1% |
| A2020042300100067 | 1 | < 0.1% |
| A2020042300100068 | 1 | < 0.1% |
| A2020042300100069 | 1 | < 0.1% |
| A2020042300100070 | 1 | < 0.1% |
| Other values (66930) | 66930 |
Length
| Value | Count | Frequency (%) |
| a2019010100100001 | 1 | < 0.1% |
| a2019010100100234 | 1 | < 0.1% |
| a2019010100100233 | 1 | < 0.1% |
| a2019010100100003 | 1 | < 0.1% |
| a2019010100100019 | 1 | < 0.1% |
| a2019010100100020 | 1 | < 0.1% |
| a2019010100100021 | 1 | < 0.1% |
| a2019010100100022 | 1 | < 0.1% |
| a2019010100100023 | 1 | < 0.1% |
| a2019010100100041 | 1 | < 0.1% |
| Other values (66930) | 66930 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 473167 | |
| 1 | 185194 | 16.3% |
| 2 | 161834 | 14.2% |
| A | 66940 | 5.9% |
| 9 | 60386 | 5.3% |
| 3 | 39666 | 3.5% |
| 4 | 35071 | 3.1% |
| 5 | 33632 | 3.0% |
| 6 | 29924 | 2.6% |
| 7 | 26750 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1071040 | |
| Uppercase Letter | 66940 | 5.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 473167 | |
| 1 | 185194 | 17.3% |
| 2 | 161834 | 15.1% |
| 9 | 60386 | 5.6% |
| 3 | 39666 | 3.7% |
| 4 | 35071 | 3.3% |
| 5 | 33632 | 3.1% |
| 6 | 29924 | 2.8% |
| 7 | 26750 | 2.5% |
| 8 | 25416 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1071040 | |
| Latin | 66940 | 5.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 473167 | |
| 1 | 185194 | 17.3% |
| 2 | 161834 | 15.1% |
| 9 | 60386 | 5.6% |
| 3 | 39666 | 3.7% |
| 4 | 35071 | 3.3% |
| 5 | 33632 | 3.1% |
| 6 | 29924 | 2.8% |
| 7 | 26750 | 2.5% |
| 8 | 25416 | 2.4% |
Latin
| Value | Count | Frequency (%) |
| A | 66940 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1137980 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 473167 | |
| 1 | 185194 | 16.3% |
| 2 | 161834 | 14.2% |
| A | 66940 | 5.9% |
| 9 | 60386 | 5.3% |
| 3 | 39666 | 3.5% |
| 4 | 35071 | 3.1% |
| 5 | 33632 | 3.0% |
| 6 | 29924 | 2.6% |
| 7 | 26750 | 2.4% |
사고년도
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 2019 | |
|---|---|
| 2020 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 267760 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 35399 | |
| 2020 | 31541 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2019 | 35399 | |
| 2020 | 31541 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 98481 | |
| 0 | 98481 | |
| 1 | 35399 | 13.2% |
| 9 | 35399 | 13.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 267760 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 98481 | |
| 0 | 98481 | |
| 1 | 35399 | 13.2% |
| 9 | 35399 | 13.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 267760 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 98481 | |
| 0 | 98481 | |
| 1 | 35399 | 13.2% |
| 9 | 35399 | 13.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 267760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 98481 | |
| 0 | 98481 | |
| 1 | 35399 | 13.2% |
| 9 | 35399 | 13.2% |
사고월
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.623946818 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.395981179 |
|---|---|
| Coefficient of variation (CV) | 0.5126824342 |
| Kurtosis | -1.176971574 |
| Mean | 6.623946818 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.05545335964 |
| Sum | 443407 |
| Variance | 11.53268817 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 6068 | |
| 11 | 6004 | |
| 5 | 5955 | |
| 6 | 5846 | |
| 7 | 5822 | |
| 8 | 5701 | |
| 9 | 5613 | |
| 4 | 5439 | |
| 1 | 5237 | |
| 12 | 5235 | |
| Other values (2) | 10020 |
| Value | Count | Frequency (%) |
| 1 | 5237 | |
| 2 | 4920 | |
| 3 | 5100 | |
| 4 | 5439 | |
| 5 | 5955 | |
| 6 | 5846 | |
| 7 | 5822 | |
| 8 | 5701 | |
| 9 | 5613 | |
| 10 | 6068 |
| Value | Count | Frequency (%) |
| 12 | 5235 | |
| 11 | 6004 | |
| 10 | 6068 | |
| 9 | 5613 | |
| 8 | 5701 | |
| 7 | 5822 | |
| 6 | 5846 | |
| 5 | 5955 | |
| 4 | 5439 | |
| 3 | 5100 |
사고일
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.90470571 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.707485334 |
|---|---|
| Coefficient of variation (CV) | 0.5474785572 |
| Kurtosis | -1.170247768 |
| Mean | 15.90470571 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.008839538678 |
| Sum | 1064661 |
| Variance | 75.82030084 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 2341 | 3.5% |
| 23 | 2335 | 3.5% |
| 19 | 2310 | 3.5% |
| 24 | 2291 | 3.4% |
| 8 | 2290 | 3.4% |
| 11 | 2272 | 3.4% |
| 17 | 2270 | 3.4% |
| 25 | 2263 | 3.4% |
| 15 | 2261 | 3.4% |
| 14 | 2256 | 3.4% |
| Other values (21) | 44051 |
| Value | Count | Frequency (%) |
| 1 | 1898 | |
| 2 | 2095 | |
| 3 | 2056 | |
| 4 | 2168 | |
| 5 | 2123 | |
| 6 | 2058 | |
| 7 | 2245 | |
| 8 | 2290 | |
| 9 | 2174 | |
| 10 | 2219 |
| Value | Count | Frequency (%) |
| 31 | 1354 | |
| 30 | 2043 | |
| 29 | 2127 | |
| 28 | 2170 | |
| 27 | 2052 | |
| 26 | 2047 | |
| 25 | 2263 | |
| 24 | 2291 | |
| 23 | 2335 | |
| 22 | 2237 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5634367842 |
| Minimum | 0 |
|---|---|
| Maximum | 0.9583333333 |
| Zeros | 2018 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04166666667 |
| Q1 | 0.375 |
| median | 0.5833333333 |
| Q3 | 0.75 |
| 95-th percentile | 0.9166666667 |
| Maximum | 0.9583333333 |
| Range | 0.9583333333 |
| Interquartile range (IQR) | 0.375 |
Descriptive statistics
| Standard deviation | 0.253978583 |
|---|---|
| Coefficient of variation (CV) | 0.4507667766 |
| Kurtosis | -0.5857274026 |
| Mean | 0.5634367842 |
| Median Absolute Deviation (MAD) | 0.1666666667 |
| Skewness | -0.4720509568 |
| Sum | 37716.45833 |
| Variance | 0.06450512063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.75 | 4682 | 7.0% |
| 0.7083333333 | 4170 | 6.2% |
| 0.7916666667 | 4080 | 6.1% |
| 0.6666666667 | 3991 | 6.0% |
| 0.625 | 3805 | 5.7% |
| 0.5416666667 | 3596 | 5.4% |
| 0.5833333333 | 3587 | 5.4% |
| 0.5 | 3446 | 5.1% |
| 0.8333333333 | 3357 | 5.0% |
| 0.4583333333 | 3278 | 4.9% |
| Other values (14) | 28948 |
| Value | Count | Frequency (%) |
| 0 | 2018 | |
| 0.04166666667 | 1636 | |
| 0.08333333333 | 1169 | 1.7% |
| 0.125 | 900 | 1.3% |
| 0.1666666667 | 886 | 1.3% |
| 0.2083333333 | 1179 | 1.8% |
| 0.25 | 1374 | |
| 0.2916666667 | 1891 | |
| 0.3333333333 | 3179 | |
| 0.375 | 3089 |
| Value | Count | Frequency (%) |
| 0.9583333333 | 2530 | |
| 0.9166666667 | 2843 | |
| 0.875 | 3269 | |
| 0.8333333333 | 3357 | |
| 0.7916666667 | 4080 | |
| 0.75 | 4682 | |
| 0.7083333333 | 4170 | |
| 0.6666666667 | 3991 | |
| 0.625 | 3805 | |
| 0.5833333333 | 3587 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.888362713 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 9625 |
| Zeros (%) | 14.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.927744885 |
|---|---|
| Coefficient of variation (CV) | 0.6674178682 |
| Kurtosis | -1.185360148 |
| Mean | 2.888362713 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.02694390661 |
| Sum | 193347 |
| Variance | 3.71620034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 10813 | |
| 3 | 10033 | |
| 1 | 10032 | |
| 2 | 9782 | |
| 0 | 9625 | |
| 5 | 9530 | |
| 6 | 7125 |
| Value | Count | Frequency (%) |
| 0 | 9625 | |
| 1 | 10032 | |
| 2 | 9782 | |
| 3 | 10033 | |
| 4 | 10813 | |
| 5 | 9530 | |
| 6 | 7125 |
| Value | Count | Frequency (%) |
| 6 | 7125 | |
| 5 | 9530 | |
| 4 | 10813 | |
| 3 | 10033 | |
| 2 | 9782 | |
| 1 | 10032 | |
| 0 | 9625 |
시군구_대범주
Categorical
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 강남구 | |
|---|---|
| 송파구 | |
| 영등포구 | 4289 |
| 서초구 | 4210 |
| 강서구 | 2995 |
| Other values (21) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.104377054 |
| Min length | 2 |
Characters and Unicode
| Total characters | 207807 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 강서구 |
|---|---|
| 2nd row | 구로구 |
| 3rd row | 서초구 |
| 4th row | 중구 |
| 5th row | 성동구 |
Common Values
| Value | Count | Frequency (%) |
| 강남구 | 6633 | 9.9% |
| 송파구 | 5029 | 7.5% |
| 영등포구 | 4289 | 6.4% |
| 서초구 | 4210 | 6.3% |
| 강서구 | 2995 | 4.5% |
| 노원구 | 2922 | 4.4% |
| 동대문구 | 2786 | 4.2% |
| 중랑구 | 2705 | 4.0% |
| 성북구 | 2526 | 3.8% |
| 구로구 | 2525 | 3.8% |
| Other values (16) | 30320 |
Length
| Value | Count | Frequency (%) |
| 강남구 | 6633 | 9.9% |
| 송파구 | 5029 | 7.5% |
| 영등포구 | 4289 | 6.4% |
| 서초구 | 4210 | 6.3% |
| 강서구 | 2995 | 4.5% |
| 노원구 | 2922 | 4.4% |
| 동대문구 | 2786 | 4.2% |
| 중랑구 | 2705 | 4.0% |
| 성북구 | 2526 | 3.8% |
| 구로구 | 2525 | 3.8% |
| Other values (16) | 30320 |
Most occurring characters
| Value | Count | Frequency (%) |
| 구 | 69462 | |
| 강 | 14238 | 6.9% |
| 동 | 9643 | 4.6% |
| 서 | 9061 | 4.4% |
| 포 | 6696 | 3.2% |
| 남 | 6633 | 3.2% |
| 송 | 5029 | 2.4% |
| 파 | 5029 | 2.4% |
| 북 | 4688 | 2.3% |
| 중 | 4649 | 2.2% |
| Other values (29) | 72679 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 207807 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 구 | 69462 | |
| 강 | 14238 | 6.9% |
| 동 | 9643 | 4.6% |
| 서 | 9061 | 4.4% |
| 포 | 6696 | 3.2% |
| 남 | 6633 | 3.2% |
| 송 | 5029 | 2.4% |
| 파 | 5029 | 2.4% |
| 북 | 4688 | 2.3% |
| 중 | 4649 | 2.2% |
| Other values (29) | 72679 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 207807 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 구 | 69462 | |
| 강 | 14238 | 6.9% |
| 동 | 9643 | 4.6% |
| 서 | 9061 | 4.4% |
| 포 | 6696 | 3.2% |
| 남 | 6633 | 3.2% |
| 송 | 5029 | 2.4% |
| 파 | 5029 | 2.4% |
| 북 | 4688 | 2.3% |
| 중 | 4649 | 2.2% |
| Other values (29) | 72679 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 207807 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 구 | 69462 | |
| 강 | 14238 | 6.9% |
| 동 | 9643 | 4.6% |
| 서 | 9061 | 4.4% |
| 포 | 6696 | 3.2% |
| 남 | 6633 | 3.2% |
| 송 | 5029 | 2.4% |
| 파 | 5029 | 2.4% |
| 북 | 4688 | 2.3% |
| 중 | 4649 | 2.2% |
| Other values (29) | 72679 |
| Distinct | 465 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 강남구 역삼동 | 1422 |
|---|---|
| 관악구 신림동 | 1306 |
| 서초구 서초동 | 1268 |
| 노원구 상계동 | 1240 |
| 강서구 화곡동 | 1121 |
| Other values (460) |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.313683896 |
| Min length | 5 |
Characters and Unicode
| Total characters | 489578 |
|---|---|
| Distinct characters | 218 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 강서구 방화동 |
|---|---|
| 2nd row | 구로구 고척동 |
| 3rd row | 서초구 서초동 |
| 4th row | 중구 회현동2가 |
| 5th row | 성동구 행당동 |
Common Values
| Value | Count | Frequency (%) |
| 강남구 역삼동 | 1422 | 2.1% |
| 관악구 신림동 | 1306 | 2.0% |
| 서초구 서초동 | 1268 | 1.9% |
| 노원구 상계동 | 1240 | 1.9% |
| 강서구 화곡동 | 1121 | 1.7% |
| 강남구 논현동 | 1071 | 1.6% |
| 구로구 구로동 | 1047 | 1.6% |
| 관악구 봉천동 | 1016 | 1.5% |
| 중랑구 면목동 | 962 | 1.4% |
| 양천구 목동 | 931 | 1.4% |
| Other values (455) | 55556 |
Length
| Value | Count | Frequency (%) |
| 강남구 | 6633 | 5.0% |
| 송파구 | 5029 | 3.8% |
| 영등포구 | 4289 | 3.2% |
| 서초구 | 4210 | 3.1% |
| 강서구 | 2995 | 2.2% |
| 노원구 | 2922 | 2.2% |
| 동대문구 | 2786 | 2.1% |
| 중랑구 | 2705 | 2.0% |
| 성북구 | 2526 | 1.9% |
| 구로구 | 2525 | 1.9% |
| Other values (478) | 97260 |
Most occurring characters
| Value | Count | Frequency (%) |
| 동 | 75625 | 15.4% |
| 구 | 71188 | 14.5% |
| 66940 | 13.7% | |
| 강 | 14687 | 3.0% |
| 서 | 11026 | 2.3% |
| 포 | 8762 | 1.8% |
| 가 | 7736 | 1.6% |
| 남 | 7665 | 1.6% |
| 대 | 7045 | 1.4% |
| 로 | 6966 | 1.4% |
| Other values (208) | 211938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 416593 | |
| Space Separator | 66940 | 13.7% |
| Decimal Number | 6045 | 1.2% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 동 | 75625 | 18.2% |
| 구 | 71188 | 17.1% |
| 강 | 14687 | 3.5% |
| 서 | 11026 | 2.6% |
| 포 | 8762 | 2.1% |
| 가 | 7736 | 1.9% |
| 남 | 7665 | 1.8% |
| 대 | 7045 | 1.7% |
| 로 | 6966 | 1.7% |
| 성 | 6731 | 1.6% |
| Other values (199) | 199162 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1603 | |
| 2 | 1450 | |
| 3 | 997 | |
| 4 | 777 | |
| 5 | 516 | 8.5% |
| 6 | 449 | 7.4% |
| 7 | 184 | 3.0% |
| 8 | 69 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 416593 | |
| Common | 72985 | 14.9% |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 동 | 75625 | 18.2% |
| 구 | 71188 | 17.1% |
| 강 | 14687 | 3.5% |
| 서 | 11026 | 2.6% |
| 포 | 8762 | 2.1% |
| 가 | 7736 | 1.9% |
| 남 | 7665 | 1.8% |
| 대 | 7045 | 1.7% |
| 로 | 6966 | 1.7% |
| 성 | 6731 | 1.6% |
| Other values (199) | 199162 |
Common
| Value | Count | Frequency (%) |
| 66940 | ||
| 1 | 1603 | 2.2% |
| 2 | 1450 | 2.0% |
| 3 | 997 | 1.4% |
| 4 | 777 | 1.1% |
| 5 | 516 | 0.7% |
| 6 | 449 | 0.6% |
| 7 | 184 | 0.3% |
| 8 | 69 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 416593 | |
| ASCII | 72985 | 14.9% |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 동 | 75625 | 18.2% |
| 구 | 71188 | 17.1% |
| 강 | 14687 | 3.5% |
| 서 | 11026 | 2.6% |
| 포 | 8762 | 2.1% |
| 가 | 7736 | 1.9% |
| 남 | 7665 | 1.8% |
| 대 | 7045 | 1.7% |
| 로 | 6966 | 1.7% |
| 성 | 6731 | 1.6% |
| Other values (199) | 199162 |
ASCII
| Value | Count | Frequency (%) |
| 66940 | ||
| 1 | 1603 | 2.2% |
| 2 | 1450 | 2.0% |
| 3 | 997 | 1.4% |
| 4 | 777 | 1.1% |
| 5 | 516 | 0.7% |
| 6 | 449 | 0.6% |
| 7 | 184 | 0.3% |
| 8 | 69 | 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 경상사고 | |
|---|---|
| 중상사고 | |
| 부상신고사고 | 4268 |
| 사망사고 | 403 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.12751718 |
| Min length | 4 |
Characters and Unicode
| Total characters | 276296 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 경상사고 |
|---|---|
| 2nd row | 경상사고 |
| 3rd row | 경상사고 |
| 4th row | 경상사고 |
| 5th row | 경상사고 |
Common Values
| Value | Count | Frequency (%) |
| 경상사고 | 45986 | |
| 중상사고 | 16283 | 24.3% |
| 부상신고사고 | 4268 | 6.4% |
| 사망사고 | 403 | 0.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 경상사고 | 45986 | |
| 중상사고 | 16283 | 24.3% |
| 부상신고사고 | 4268 | 6.4% |
| 사망사고 | 403 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 고 | 71208 | |
| 사 | 67343 | |
| 상 | 66537 | |
| 경 | 45986 | |
| 중 | 16283 | 5.9% |
| 부 | 4268 | 1.5% |
| 신 | 4268 | 1.5% |
| 망 | 403 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 276296 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 고 | 71208 | |
| 사 | 67343 | |
| 상 | 66537 | |
| 경 | 45986 | |
| 중 | 16283 | 5.9% |
| 부 | 4268 | 1.5% |
| 신 | 4268 | 1.5% |
| 망 | 403 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 276296 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 고 | 71208 | |
| 사 | 67343 | |
| 상 | 66537 | |
| 경 | 45986 | |
| 중 | 16283 | 5.9% |
| 부 | 4268 | 1.5% |
| 신 | 4268 | 1.5% |
| 망 | 403 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 276296 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 고 | 71208 | |
| 사 | 67343 | |
| 상 | 66537 | |
| 경 | 45986 | |
| 중 | 16283 | 5.9% |
| 부 | 4268 | 1.5% |
| 신 | 4268 | 1.5% |
| 망 | 403 | 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0 | |
|---|---|
| 1 | 402 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 66940 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 66940 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66940 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 66537 | |
| 1 | 402 | 0.6% |
| 2 | 1 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2645802211 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 50625 |
| Zeros (%) | 75.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4994958498 |
|---|---|
| Coefficient of variation (CV) | 1.887880537 |
| Kurtosis | 11.90191183 |
| Mean | 0.2645802211 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.27850235 |
| Sum | 17711 |
| Variance | 0.249496104 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 50625 | |
| 1 | 15192 | 22.7% |
| 2 | 936 | 1.4% |
| 3 | 137 | 0.2% |
| 4 | 34 | 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 50625 | |
| 1 | 15192 | 22.7% |
| 2 | 936 | 1.4% |
| 3 | 137 | 0.2% |
| 4 | 34 | 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 6 | 4 | < 0.1% |
| 5 | 8 | < 0.1% |
| 4 | 34 | 0.1% |
| 3 | 137 | 0.2% |
| 2 | 936 | 1.4% |
| 1 | 15192 | 22.7% |
| 0 | 50625 |
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.002688975 |
| Minimum | 0 |
|---|---|
| Maximum | 41 |
| Zeros | 17900 |
| Zeros (%) | 26.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 41 |
| Range | 41 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9670432048 |
|---|---|
| Coefficient of variation (CV) | 0.9644498231 |
| Kurtosis | 78.52393101 |
| Mean | 1.002688975 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.20858312 |
| Sum | 67120 |
| Variance | 0.9351725599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 37260 | |
| 0 | 17900 | |
| 2 | 8150 | 12.2% |
| 3 | 2270 | 3.4% |
| 4 | 798 | 1.2% |
| 5 | 301 | 0.4% |
| 6 | 115 | 0.2% |
| 7 | 50 | 0.1% |
| 8 | 34 | 0.1% |
| 9 | 20 | < 0.1% |
| Other values (12) | 42 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 17900 | |
| 1 | 37260 | |
| 2 | 8150 | 12.2% |
| 3 | 2270 | 3.4% |
| 4 | 798 | 1.2% |
| 5 | 301 | 0.4% |
| 6 | 115 | 0.2% |
| 7 | 50 | 0.1% |
| 8 | 34 | 0.1% |
| 9 | 20 | < 0.1% |
| Value | Count | Frequency (%) |
| 41 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 6 | |
| 12 | 4 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1113235733 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 60363 |
| Zeros (%) | 90.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.382203501 |
|---|---|
| Coefficient of variation (CV) | 3.433266553 |
| Kurtosis | 161.0793703 |
| Mean | 0.1113235733 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.326259744 |
| Sum | 7452 |
| Variance | 0.1460795161 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 60363 | |
| 1 | 5973 | 8.9% |
| 2 | 458 | 0.7% |
| 3 | 98 | 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 60363 | |
| 1 | 5973 | 8.9% |
| 2 | 458 | 0.7% |
| 3 | 98 | 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 4 | < 0.1% |
| 5 | 7 | < 0.1% |
| 4 | 27 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 차대차 | |
|---|---|
| 차대사람 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.238840753 |
| Min length | 3 |
Characters and Unicode
| Total characters | 216808 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 차대사람 |
|---|---|
| 2nd row | 차대차 |
| 3rd row | 차대차 |
| 4th row | 차대차 |
| 5th row | 차대사람 |
Common Values
| Value | Count | Frequency (%) |
| 차대차 | 50952 | |
| 차대사람 | 15988 | 23.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 차대차 | 50952 | |
| 차대사람 | 15988 | 23.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 차 | 117892 | |
| 대 | 66940 | |
| 사 | 15988 | 7.4% |
| 람 | 15988 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 216808 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 차 | 117892 | |
| 대 | 66940 | |
| 사 | 15988 | 7.4% |
| 람 | 15988 | 7.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 216808 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 차 | 117892 | |
| 대 | 66940 | |
| 사 | 15988 | 7.4% |
| 람 | 15988 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 216808 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 차 | 117892 | |
| 대 | 66940 | |
| 사 | 15988 | 7.4% |
| 람 | 15988 | 7.4% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 차대차 - 측면충돌 | |
|---|---|
| 차대차 - 기타 | |
| 차대차 - 추돌 | |
| 차대사람 - 기타 | |
| 차대사람 - 횡단중 | |
| Other values (5) |
Length
| Max length | 17 |
|---|---|
| Median length | 12 |
| Mean length | 9.3452196 |
| Min length | 8 |
Characters and Unicode
| Total characters | 625569 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 차대사람 - 횡단중 |
|---|---|
| 2nd row | 차대차 - 추돌 |
| 3rd row | 차대차 - 기타 |
| 4th row | 차대차 - 측면충돌 |
| 5th row | 차대사람 - 횡단중 |
Common Values
| Value | Count | Frequency (%) |
| 차대차 - 측면충돌 | 23320 | |
| 차대차 - 기타 | 14445 | |
| 차대차 - 추돌 | 10468 | |
| 차대사람 - 기타 | 6674 | 10.0% |
| 차대사람 - 횡단중 | 5728 | 8.6% |
| 차대차 - 정면충돌 | 1817 | 2.7% |
| 차대사람 - 차도통행중 | 1602 | 2.4% |
| 차대사람 - 보도통행중 | 1065 | 1.6% |
| 차대사람 - 길가장자리구역통행중 | 919 | 1.4% |
| 차대차 - 후진중충돌 | 902 | 1.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 66940 | ||
| 차대차 | 50952 | |
| 측면충돌 | 23320 | 11.6% |
| 기타 | 21119 | 10.5% |
| 차대사람 | 15988 | 8.0% |
| 추돌 | 10468 | 5.2% |
| 횡단중 | 5728 | 2.9% |
| 정면충돌 | 1817 | 0.9% |
| 차도통행중 | 1602 | 0.8% |
| 보도통행중 | 1065 | 0.5% |
| Other values (2) | 1821 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 133880 | ||
| 차 | 119494 | |
| 대 | 66940 | |
| - | 66940 | |
| 돌 | 36507 | 5.8% |
| 충 | 26039 | 4.2% |
| 면 | 25137 | 4.0% |
| 측 | 23320 | 3.7% |
| 기 | 21119 | 3.4% |
| 타 | 21119 | 3.4% |
| Other values (20) | 85074 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 424749 | |
| Space Separator | 133880 | 21.4% |
| Dash Punctuation | 66940 | 10.7% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 차 | 119494 | |
| 대 | 66940 | |
| 돌 | 36507 | 8.6% |
| 충 | 26039 | 6.1% |
| 면 | 25137 | 5.9% |
| 측 | 23320 | 5.5% |
| 기 | 21119 | 5.0% |
| 타 | 21119 | 5.0% |
| 사 | 15988 | 3.8% |
| 람 | 15988 | 3.8% |
| Other values (18) | 53098 |
Space Separator
| Value | Count | Frequency (%) |
| 133880 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 424749 | |
| Common | 200820 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 차 | 119494 | |
| 대 | 66940 | |
| 돌 | 36507 | 8.6% |
| 충 | 26039 | 6.1% |
| 면 | 25137 | 5.9% |
| 측 | 23320 | 5.5% |
| 기 | 21119 | 5.0% |
| 타 | 21119 | 5.0% |
| 사 | 15988 | 3.8% |
| 람 | 15988 | 3.8% |
| Other values (18) | 53098 |
Common
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 424749 | |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 |
Hangul
| Value | Count | Frequency (%) |
| 차 | 119494 | |
| 대 | 66940 | |
| 돌 | 36507 | 8.6% |
| 충 | 26039 | 6.1% |
| 면 | 25137 | 5.9% |
| 측 | 23320 | 5.5% |
| 기 | 21119 | 5.0% |
| 타 | 21119 | 5.0% |
| 사 | 15988 | 3.8% |
| 람 | 15988 | 3.8% |
| Other values (18) | 53098 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 단일로 | |
|---|---|
| 교차로 | |
| 기타 | |
| 주차장 | 214 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.948700329 |
| Min length | 2 |
Characters and Unicode
| Total characters | 197386 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 교차로 |
|---|---|
| 2nd row | 단일로 |
| 3rd row | 기타 |
| 4th row | 단일로 |
| 5th row | 교차로 |
Common Values
| Value | Count | Frequency (%) |
| 단일로 | 33555 | |
| 교차로 | 29737 | |
| 기타 | 3434 | 5.1% |
| 주차장 | 214 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 단일로 | 33555 | |
| 교차로 | 29737 | |
| 기타 | 3434 | 5.1% |
| 주차장 | 214 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 로 | 63292 | |
| 단 | 33555 | |
| 일 | 33555 | |
| 차 | 29951 | |
| 교 | 29737 | |
| 기 | 3434 | 1.7% |
| 타 | 3434 | 1.7% |
| 주 | 214 | 0.1% |
| 장 | 214 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 197386 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 로 | 63292 | |
| 단 | 33555 | |
| 일 | 33555 | |
| 차 | 29951 | |
| 교 | 29737 | |
| 기 | 3434 | 1.7% |
| 타 | 3434 | 1.7% |
| 주 | 214 | 0.1% |
| 장 | 214 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 197386 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 로 | 63292 | |
| 단 | 33555 | |
| 일 | 33555 | |
| 차 | 29951 | |
| 교 | 29737 | |
| 기 | 3434 | 1.7% |
| 타 | 3434 | 1.7% |
| 주 | 214 | 0.1% |
| 장 | 214 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 197386 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 로 | 63292 | |
| 단 | 33555 | |
| 일 | 33555 | |
| 차 | 29951 | |
| 교 | 29737 | |
| 기 | 3434 | 1.7% |
| 타 | 3434 | 1.7% |
| 주 | 214 | 0.1% |
| 장 | 214 | 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 단일로 - 기타 | |
|---|---|
| 교차로 - 교차로안 | |
| 교차로 - 교차로부근 | |
| 기타 - 기타 | |
| 교차로 - 교차로횡단보도내 | 2603 |
| Other values (6) | 1845 |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 9.220914252 |
| Min length | 7 |
Characters and Unicode
| Total characters | 617248 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 교차로 - 교차로횡단보도내 |
|---|---|
| 2nd row | 단일로 - 기타 |
| 3rd row | 기타 - 기타 |
| 4th row | 단일로 - 터널 |
| 5th row | 교차로 - 교차로부근 |
Common Values
| Value | Count | Frequency (%) |
| 단일로 - 기타 | 31941 | |
| 교차로 - 교차로안 | 17473 | |
| 교차로 - 교차로부근 | 9661 | 14.4% |
| 기타 - 기타 | 3417 | 5.1% |
| 교차로 - 교차로횡단보도내 | 2603 | 3.9% |
| 단일로 - 지하차도(도로)내 | 663 | 1.0% |
| 단일로 - 교량 | 517 | 0.8% |
| 단일로 - 고가도로위 | 242 | 0.4% |
| 주차장 - 주차장 | 214 | 0.3% |
| 단일로 - 터널 | 192 | 0.3% |
Length
| Value | Count | Frequency (%) |
| 66940 | ||
| 기타 | 38775 | |
| 단일로 | 33555 | |
| 교차로 | 29737 | |
| 교차로안 | 17473 | 8.7% |
| 교차로부근 | 9661 | 4.8% |
| 교차로횡단보도내 | 2603 | 1.3% |
| 지하차도(도로)내 | 663 | 0.3% |
| 교량 | 517 | 0.3% |
| 주차장 | 428 | 0.2% |
| Other values (3) | 468 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 133880 | ||
| 로 | 93934 | |
| - | 66940 | |
| 차 | 60565 | |
| 교 | 59991 | |
| 기 | 38775 | 6.3% |
| 타 | 38775 | 6.3% |
| 단 | 36158 | 5.9% |
| 일 | 33555 | 5.4% |
| 안 | 17473 | 2.8% |
| Other values (21) | 37202 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 415102 | |
| Space Separator | 133880 | 21.7% |
| Dash Punctuation | 66940 | 10.8% |
| Open Punctuation | 663 | 0.1% |
| Close Punctuation | 663 | 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 로 | 93934 | |
| 차 | 60565 | |
| 교 | 59991 | |
| 기 | 38775 | |
| 타 | 38775 | |
| 단 | 36158 | 8.7% |
| 일 | 33555 | 8.1% |
| 안 | 17473 | 4.2% |
| 부 | 9661 | 2.3% |
| 근 | 9661 | 2.3% |
| Other values (17) | 16554 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 133880 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 66940 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 663 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 663 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 415102 | |
| Common | 202146 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 로 | 93934 | |
| 차 | 60565 | |
| 교 | 59991 | |
| 기 | 38775 | |
| 타 | 38775 | |
| 단 | 36158 | 8.7% |
| 일 | 33555 | 8.1% |
| 안 | 17473 | 4.2% |
| 부 | 9661 | 2.3% |
| 근 | 9661 | 2.3% |
| Other values (17) | 16554 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 | |
| ( | 663 | 0.3% |
| ) | 663 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 415102 | |
| ASCII | 202146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 | |
| ( | 663 | 0.3% |
| ) | 663 | 0.3% |
Hangul
| Value | Count | Frequency (%) |
| 로 | 93934 | |
| 차 | 60565 | |
| 교 | 59991 | |
| 기 | 38775 | |
| 타 | 38775 | |
| 단 | 36158 | 8.7% |
| 일 | 33555 | 8.1% |
| 안 | 17473 | 4.2% |
| 부 | 9661 | 2.3% |
| 근 | 9661 | 2.3% |
| Other values (17) | 16554 | 4.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 포장 | |
|---|---|
| 비포장 | 48 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.00071706 |
| Min length | 2 |
Characters and Unicode
| Total characters | 133928 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 포장 |
|---|---|
| 2nd row | 포장 |
| 3rd row | 포장 |
| 4th row | 포장 |
| 5th row | 포장 |
Common Values
| Value | Count | Frequency (%) |
| 포장 | 66892 | |
| 비포장 | 48 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 포장 | 66892 | |
| 비포장 | 48 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 비 | 48 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 133928 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 비 | 48 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 133928 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 비 | 48 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 133928 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 비 | 48 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 포장 - 건조 | |
|---|---|
| 포장 - 젖음/습기 | 5410 |
| 포장 - 기타 | 809 |
| 포장 - 서리/결빙 | 46 |
| 포장 - 적설 | 35 |
| Other values (5) | 51 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.246354945 |
| Min length | 7 |
Characters and Unicode
| Total characters | 485071 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 포장 - 건조 |
|---|---|
| 2nd row | 포장 - 건조 |
| 3rd row | 포장 - 건조 |
| 4th row | 포장 - 건조 |
| 5th row | 포장 - 건조 |
Common Values
| Value | Count | Frequency (%) |
| 포장 - 건조 | 60589 | |
| 포장 - 젖음/습기 | 5410 | 8.1% |
| 포장 - 기타 | 809 | 1.2% |
| 포장 - 서리/결빙 | 46 | 0.1% |
| 포장 - 적설 | 35 | 0.1% |
| 비포장 - 젖음/습기 | 25 | < 0.1% |
| 비포장 - 건조 | 19 | < 0.1% |
| 비포장 - 기타 | 4 | < 0.1% |
| 포장 - 해빙 | 2 | < 0.1% |
| 포장 - 침수 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 66940 | ||
| 포장 | 66892 | |
| 건조 | 60608 | |
| 젖음/습기 | 5435 | 2.7% |
| 기타 | 813 | 0.4% |
| 비포장 | 48 | < 0.1% |
| 서리/결빙 | 46 | < 0.1% |
| 적설 | 35 | < 0.1% |
| 해빙 | 2 | < 0.1% |
| 침수 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 133880 | ||
| 포 | 66940 | |
| 장 | 66940 | |
| - | 66940 | |
| 건 | 60608 | |
| 조 | 60608 | |
| 기 | 6248 | 1.3% |
| / | 5481 | 1.1% |
| 젖 | 5435 | 1.1% |
| 음 | 5435 | 1.1% |
| Other values (12) | 6556 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 278770 | |
| Space Separator | 133880 | |
| Dash Punctuation | 66940 | 13.8% |
| Other Punctuation | 5481 | 1.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 건 | 60608 | |
| 조 | 60608 | |
| 기 | 6248 | 2.2% |
| 젖 | 5435 | 1.9% |
| 음 | 5435 | 1.9% |
| 습 | 5435 | 1.9% |
| 타 | 813 | 0.3% |
| 빙 | 48 | < 0.1% |
| Other values (9) | 260 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 133880 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 66940 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 5481 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 278770 | |
| Common | 206301 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 건 | 60608 | |
| 조 | 60608 | |
| 기 | 6248 | 2.2% |
| 젖 | 5435 | 1.9% |
| 음 | 5435 | 1.9% |
| 습 | 5435 | 1.9% |
| 타 | 813 | 0.3% |
| 빙 | 48 | < 0.1% |
| Other values (9) | 260 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 | |
| / | 5481 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 278770 | |
| ASCII | 206301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 133880 | ||
| - | 66940 | |
| / | 5481 | 2.7% |
Hangul
| Value | Count | Frequency (%) |
| 포 | 66940 | |
| 장 | 66940 | |
| 건 | 60608 | |
| 조 | 60608 | |
| 기 | 6248 | 2.2% |
| 젖 | 5435 | 1.9% |
| 음 | 5435 | 1.9% |
| 습 | 5435 | 1.9% |
| 타 | 813 | 0.3% |
| 빙 | 48 | < 0.1% |
| Other values (9) | 260 | 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 안전운전불이행 | |
|---|---|
| 안전거리미확보 | |
| 신호위반 | |
| 보행자보호의무위반 | 2506 |
| 교차로운행방법위반 | 2501 |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.495682701 |
| Min length | 2 |
Characters and Unicode
| Total characters | 434821 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 보행자보호의무위반 |
|---|---|
| 2nd row | 안전운전불이행 |
| 3rd row | 안전운전불이행 |
| 4th row | 안전운전불이행 |
| 5th row | 안전운전불이행 |
Common Values
| Value | Count | Frequency (%) |
| 안전운전불이행 | 36380 | |
| 안전거리미확보 | 9607 | 14.4% |
| 신호위반 | 8491 | 12.7% |
| 보행자보호의무위반 | 2506 | 3.7% |
| 교차로운행방법위반 | 2501 | 3.7% |
| 기타 | 2251 | 3.4% |
| 중앙선침범 | 1896 | 2.8% |
| 직진우회전진행방해 | 1409 | 2.1% |
| 차로위반 | 1223 | 1.8% |
| 불법유턴 | 489 | 0.7% |
Length
| Value | Count | Frequency (%) |
| 안전운전불이행 | 36380 | |
| 안전거리미확보 | 9607 | 14.4% |
| 신호위반 | 8491 | 12.7% |
| 보행자보호의무위반 | 2506 | 3.7% |
| 교차로운행방법위반 | 2501 | 3.7% |
| 기타 | 2251 | 3.4% |
| 중앙선침범 | 1896 | 2.8% |
| 직진우회전진행방해 | 1409 | 2.1% |
| 차로위반 | 1223 | 1.8% |
| 불법유턴 | 489 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 전 | 83776 | |
| 안 | 45987 | |
| 행 | 42796 | |
| 운 | 38881 | |
| 불 | 36869 | 8.5% |
| 이 | 36380 | 8.4% |
| 반 | 14721 | 3.4% |
| 위 | 14721 | 3.4% |
| 보 | 14619 | 3.4% |
| 호 | 10997 | 2.5% |
| Other values (29) | 95074 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 434821 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 전 | 83776 | |
| 안 | 45987 | |
| 행 | 42796 | |
| 운 | 38881 | |
| 불 | 36869 | 8.5% |
| 이 | 36380 | 8.4% |
| 반 | 14721 | 3.4% |
| 위 | 14721 | 3.4% |
| 보 | 14619 | 3.4% |
| 호 | 10997 | 2.5% |
| Other values (29) | 95074 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 434821 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 전 | 83776 | |
| 안 | 45987 | |
| 행 | 42796 | |
| 운 | 38881 | |
| 불 | 36869 | 8.5% |
| 이 | 36380 | 8.4% |
| 반 | 14721 | 3.4% |
| 위 | 14721 | 3.4% |
| 보 | 14619 | 3.4% |
| 호 | 10997 | 2.5% |
| Other values (29) | 95074 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 434821 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 전 | 83776 | |
| 안 | 45987 | |
| 행 | 42796 | |
| 운 | 38881 | |
| 불 | 36869 | 8.5% |
| 이 | 36380 | 8.4% |
| 반 | 14721 | 3.4% |
| 위 | 14721 | 3.4% |
| 보 | 14619 | 3.4% |
| 호 | 10997 | 2.5% |
| Other values (29) | 95074 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 맑음 | |
|---|---|
| 비 | 4221 |
| 흐림 | 2619 |
| 눈 | 106 |
| 안개 | 3 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.935360024 |
| Min length | 1 |
Characters and Unicode
| Total characters | 129553 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 맑음 |
|---|---|
| 2nd row | 맑음 |
| 3rd row | 맑음 |
| 4th row | 맑음 |
| 5th row | 맑음 |
Common Values
| Value | Count | Frequency (%) |
| 맑음 | 59991 | |
| 비 | 4221 | 6.3% |
| 흐림 | 2619 | 3.9% |
| 눈 | 106 | 0.2% |
| 안개 | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 맑음 | 59991 | |
| 비 | 4221 | 6.3% |
| 흐림 | 2619 | 3.9% |
| 눈 | 106 | 0.2% |
| 안개 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 맑 | 59991 | |
| 음 | 59991 | |
| 비 | 4221 | 3.3% |
| 흐 | 2619 | 2.0% |
| 림 | 2619 | 2.0% |
| 눈 | 106 | 0.1% |
| 안 | 3 | < 0.1% |
| 개 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 129553 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 맑 | 59991 | |
| 음 | 59991 | |
| 비 | 4221 | 3.3% |
| 흐 | 2619 | 2.0% |
| 림 | 2619 | 2.0% |
| 눈 | 106 | 0.1% |
| 안 | 3 | < 0.1% |
| 개 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 129553 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 맑 | 59991 | |
| 음 | 59991 | |
| 비 | 4221 | 3.3% |
| 흐 | 2619 | 2.0% |
| 림 | 2619 | 2.0% |
| 눈 | 106 | 0.1% |
| 안 | 3 | < 0.1% |
| 개 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 129553 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 맑 | 59991 | |
| 음 | 59991 | |
| 비 | 4221 | 3.3% |
| 흐 | 2619 | 2.0% |
| 림 | 2619 | 2.0% |
| 눈 | 106 | 0.1% |
| 안 | 3 | < 0.1% |
| 개 | 3 | < 0.1% |
가해운전자차종
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 승용 | |
|---|---|
| 이륜 | |
| 화물 | |
| 승합 | 3838 |
| 자전거 | 3286 |
| Other values (6) | 2284 |
Length
| Max length | 11 |
|---|---|
| Median length | 2 |
| Mean length | 2.145787272 |
| Min length | 2 |
Characters and Unicode
| Total characters | 143639 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 승용 |
|---|---|
| 2nd row | 이륜 |
| 3rd row | 승용 |
| 4th row | 승용 |
| 5th row | 승용 |
Common Values
| Value | Count | Frequency (%) |
| 승용 | 44382 | |
| 이륜 | 7309 | 10.9% |
| 화물 | 5841 | 8.7% |
| 승합 | 3838 | 5.7% |
| 자전거 | 3286 | 4.9% |
| 원동기 | 1107 | 1.7% |
| 건설기계 | 554 | 0.8% |
| 개인형이동수단(PM) | 456 | 0.7% |
| 특수 | 149 | 0.2% |
| 사륜오토바이(ATV) | 17 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 승용 | 44382 | |
| 이륜 | 7309 | 10.9% |
| 화물 | 5841 | 8.7% |
| 승합 | 3838 | 5.7% |
| 자전거 | 3286 | 4.9% |
| 원동기 | 1107 | 1.7% |
| 건설기계 | 554 | 0.8% |
| 개인형이동수단(pm | 456 | 0.7% |
| 특수 | 149 | 0.2% |
| 사륜오토바이(atv | 17 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 승 | 48220 | |
| 용 | 44382 | |
| 이 | 7782 | 5.4% |
| 륜 | 7326 | 5.1% |
| 화 | 5841 | 4.1% |
| 물 | 5841 | 4.1% |
| 합 | 3838 | 2.7% |
| 자 | 3286 | 2.3% |
| 전 | 3286 | 2.3% |
| 거 | 3286 | 2.3% |
| Other values (24) | 10551 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 141730 | |
| Uppercase Letter | 963 | 0.7% |
| Close Punctuation | 473 | 0.3% |
| Open Punctuation | 473 | 0.3% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 승 | 48220 | |
| 용 | 44382 | |
| 이 | 7782 | 5.5% |
| 륜 | 7326 | 5.2% |
| 화 | 5841 | 4.1% |
| 물 | 5841 | 4.1% |
| 합 | 3838 | 2.7% |
| 자 | 3286 | 2.3% |
| 전 | 3286 | 2.3% |
| 거 | 3286 | 2.3% |
| Other values (17) | 8642 | 6.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 456 | |
| P | 456 | |
| A | 17 | 1.8% |
| T | 17 | 1.8% |
| V | 17 | 1.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 473 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 473 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 141730 | |
| Latin | 963 | 0.7% |
| Common | 946 | 0.7% |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 승 | 48220 | |
| 용 | 44382 | |
| 이 | 7782 | 5.5% |
| 륜 | 7326 | 5.2% |
| 화 | 5841 | 4.1% |
| 물 | 5841 | 4.1% |
| 합 | 3838 | 2.7% |
| 자 | 3286 | 2.3% |
| 전 | 3286 | 2.3% |
| 거 | 3286 | 2.3% |
| Other values (17) | 8642 | 6.1% |
Latin
| Value | Count | Frequency (%) |
| M | 456 | |
| P | 456 | |
| A | 17 | 1.8% |
| T | 17 | 1.8% |
| V | 17 | 1.8% |
Common
| Value | Count | Frequency (%) |
| ) | 473 | |
| ( | 473 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 141730 | |
| ASCII | 1909 | 1.3% |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 승 | 48220 | |
| 용 | 44382 | |
| 이 | 7782 | 5.5% |
| 륜 | 7326 | 5.2% |
| 화 | 5841 | 4.1% |
| 물 | 5841 | 4.1% |
| 합 | 3838 | 2.7% |
| 자 | 3286 | 2.3% |
| 전 | 3286 | 2.3% |
| 거 | 3286 | 2.3% |
| Other values (17) | 8642 | 6.1% |
ASCII
| Value | Count | Frequency (%) |
| ) | 473 | |
| ( | 473 | |
| M | 456 | |
| P | 456 | |
| A | 17 | 0.9% |
| T | 17 | 0.9% |
| V | 17 | 0.9% |
가해운전자남성여부
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 54542 | |
| False | 12398 | 18.5% |
가해운전자연령
Real number (ℝ≥0)
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.15582611 |
| Minimum | 20 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 35 |
| median | 50 |
| Q3 | 60 |
| 95-th percentile | 72 |
| Maximum | 80 |
| Range | 60 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 15.46394783 |
|---|---|
| Coefficient of variation (CV) | 0.3211230931 |
| Kurtosis | -0.9840362461 |
| Mean | 48.15582611 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.1076478634 |
| Sum | 3223551 |
| Variance | 239.1336826 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 2388 | 3.6% |
| 59 | 1787 | 2.7% |
| 60 | 1716 | 2.6% |
| 58 | 1710 | 2.6% |
| 57 | 1682 | 2.5% |
| 61 | 1633 | 2.4% |
| 56 | 1623 | 2.4% |
| 62 | 1598 | 2.4% |
| 55 | 1468 | 2.2% |
| 63 | 1455 | 2.2% |
| Other values (51) | 49880 |
| Value | Count | Frequency (%) |
| 20 | 2388 | |
| 21 | 455 | 0.7% |
| 22 | 607 | 0.9% |
| 23 | 735 | 1.1% |
| 24 | 822 | 1.2% |
| 25 | 937 | 1.4% |
| 26 | 1058 | |
| 27 | 1141 | |
| 28 | 1184 | |
| 29 | 1182 |
| Value | Count | Frequency (%) |
| 80 | 510 | |
| 79 | 172 | 0.3% |
| 78 | 256 | 0.4% |
| 77 | 315 | |
| 76 | 337 | |
| 75 | 367 | |
| 74 | 428 | |
| 73 | 511 | |
| 72 | 669 | |
| 71 | 777 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 상해없음 | |
|---|---|
| 경상 | |
| 부상신고 | 3141 |
| 중상 | 1515 |
| 사망 | 80 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.760382432 |
| Min length | 2 |
Characters and Unicode
| Total characters | 251720 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 상해없음 |
|---|---|
| 2nd row | 상해없음 |
| 3rd row | 상해없음 |
| 4th row | 상해없음 |
| 5th row | 상해없음 |
Common Values
| Value | Count | Frequency (%) |
| 상해없음 | 55779 | |
| 경상 | 6425 | 9.6% |
| 부상신고 | 3141 | 4.7% |
| 중상 | 1515 | 2.3% |
| 사망 | 80 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 상해없음 | 55779 | |
| 경상 | 6425 | 9.6% |
| 부상신고 | 3141 | 4.7% |
| 중상 | 1515 | 2.3% |
| 사망 | 80 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 상 | 66860 | |
| 해 | 55779 | |
| 없 | 55779 | |
| 음 | 55779 | |
| 경 | 6425 | 2.6% |
| 부 | 3141 | 1.2% |
| 신 | 3141 | 1.2% |
| 고 | 3141 | 1.2% |
| 중 | 1515 | 0.6% |
| 사 | 80 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 251720 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 상 | 66860 | |
| 해 | 55779 | |
| 없 | 55779 | |
| 음 | 55779 | |
| 경 | 6425 | 2.6% |
| 부 | 3141 | 1.2% |
| 신 | 3141 | 1.2% |
| 고 | 3141 | 1.2% |
| 중 | 1515 | 0.6% |
| 사 | 80 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 251720 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 상 | 66860 | |
| 해 | 55779 | |
| 없 | 55779 | |
| 음 | 55779 | |
| 경 | 6425 | 2.6% |
| 부 | 3141 | 1.2% |
| 신 | 3141 | 1.2% |
| 고 | 3141 | 1.2% |
| 중 | 1515 | 0.6% |
| 사 | 80 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 251720 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 상 | 66860 | |
| 해 | 55779 | |
| 없 | 55779 | |
| 음 | 55779 | |
| 경 | 6425 | 2.6% |
| 부 | 3141 | 1.2% |
| 신 | 3141 | 1.2% |
| 고 | 3141 | 1.2% |
| 중 | 1515 | 0.6% |
| 사 | 80 | < 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 승용 | |
|---|---|
| 보행자 | |
| 이륜 | |
| 자전거 | 2962 |
| 승합 | 2809 |
| Other values (4) |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.317403645 |
| Min length | 2 |
Characters and Unicode
| Total characters | 155127 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 보행자 |
|---|---|
| 2nd row | 승용 |
| 3rd row | 화물 |
| 4th row | 승용 |
| 5th row | 보행자 |
Common Values
| Value | Count | Frequency (%) |
| 승용 | 31454 | |
| 보행자 | 15988 | |
| 이륜 | 9089 | 13.6% |
| 자전거 | 2962 | 4.4% |
| 승합 | 2809 | 4.2% |
| 화물 | 2795 | 4.2% |
| 원동기 | 1187 | 1.8% |
| 기타불명 | 555 | 0.8% |
| 특수 | 101 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 승용 | 31454 | |
| 보행자 | 15988 | |
| 이륜 | 9089 | 13.6% |
| 자전거 | 2962 | 4.4% |
| 승합 | 2809 | 4.2% |
| 화물 | 2795 | 4.2% |
| 원동기 | 1187 | 1.8% |
| 기타불명 | 555 | 0.8% |
| 특수 | 101 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 승 | 34263 | |
| 용 | 31454 | |
| 자 | 18950 | |
| 보 | 15988 | |
| 행 | 15988 | |
| 이 | 9089 | 5.9% |
| 륜 | 9089 | 5.9% |
| 전 | 2962 | 1.9% |
| 거 | 2962 | 1.9% |
| 합 | 2809 | 1.8% |
| Other values (10) | 11573 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 155127 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 승 | 34263 | |
| 용 | 31454 | |
| 자 | 18950 | |
| 보 | 15988 | |
| 행 | 15988 | |
| 이 | 9089 | 5.9% |
| 륜 | 9089 | 5.9% |
| 전 | 2962 | 1.9% |
| 거 | 2962 | 1.9% |
| 합 | 2809 | 1.8% |
| Other values (10) | 11573 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 155127 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 승 | 34263 | |
| 용 | 31454 | |
| 자 | 18950 | |
| 보 | 15988 | |
| 행 | 15988 | |
| 이 | 9089 | 5.9% |
| 륜 | 9089 | 5.9% |
| 전 | 2962 | 1.9% |
| 거 | 2962 | 1.9% |
| 합 | 2809 | 1.8% |
| Other values (10) | 11573 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 155127 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 승 | 34263 | |
| 용 | 31454 | |
| 자 | 18950 | |
| 보 | 15988 | |
| 행 | 15988 | |
| 이 | 9089 | 5.9% |
| 륜 | 9089 | 5.9% |
| 전 | 2962 | 1.9% |
| 거 | 2962 | 1.9% |
| 합 | 2809 | 1.8% |
| Other values (10) | 11573 | 7.5% |
피해운전자남성여부
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 49621 | |
| False | 17319 | 25.9% |
피해운전자연령
Real number (ℝ≥0)
| Distinct | 80 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.48948312 |
| Minimum | 1 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 523.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 32 |
| median | 46 |
| Q3 | 58 |
| 95-th percentile | 72 |
| Maximum | 80 |
| Range | 79 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 16.26866434 |
|---|---|
| Coefficient of variation (CV) | 0.3576357263 |
| Kurtosis | -0.742769683 |
| Mean | 45.48948312 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.0249230641 |
| Sum | 3045066 |
| Variance | 264.6694393 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59 | 1466 | 2.2% |
| 27 | 1456 | 2.2% |
| 37 | 1440 | 2.2% |
| 58 | 1431 | 2.1% |
| 28 | 1409 | 2.1% |
| 29 | 1405 | 2.1% |
| 50 | 1404 | 2.1% |
| 49 | 1400 | 2.1% |
| 60 | 1390 | 2.1% |
| 51 | 1386 | 2.1% |
| Other values (70) | 52753 |
| Value | Count | Frequency (%) |
| 1 | 5 | < 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 29 | < 0.1% |
| 4 | 50 | 0.1% |
| 5 | 49 | 0.1% |
| 6 | 84 | |
| 7 | 99 | |
| 8 | 142 | |
| 9 | 136 | |
| 10 | 129 |
| Value | Count | Frequency (%) |
| 80 | 981 | |
| 79 | 211 | 0.3% |
| 78 | 243 | 0.4% |
| 77 | 316 | 0.5% |
| 76 | 337 | 0.5% |
| 75 | 357 | 0.5% |
| 74 | 304 | 0.5% |
| 73 | 391 | 0.6% |
| 72 | 560 | |
| 71 | 549 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 경상 | |
|---|---|
| 중상 | |
| 상해없음 | |
| 부상신고 | 2478 |
| 사망 | 301 |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.354884972 |
| Min length | 2 |
Characters and Unicode
| Total characters | 157636 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 경상 |
|---|---|
| 2nd row | 경상 |
| 3rd row | 경상 |
| 4th row | 경상 |
| 5th row | 경상 |
Common Values
| Value | Count | Frequency (%) |
| 경상 | 41364 | |
| 중상 | 13397 | 20.0% |
| 상해없음 | 9100 | 13.6% |
| 부상신고 | 2478 | 3.7% |
| 사망 | 301 | 0.4% |
| 기타불명 | 300 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 경상 | 41364 | |
| 중상 | 13397 | 20.0% |
| 상해없음 | 9100 | 13.6% |
| 부상신고 | 2478 | 3.7% |
| 사망 | 301 | 0.4% |
| 기타불명 | 300 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 상 | 66339 | |
| 경 | 41364 | |
| 중 | 13397 | 8.5% |
| 해 | 9100 | 5.8% |
| 없 | 9100 | 5.8% |
| 음 | 9100 | 5.8% |
| 부 | 2478 | 1.6% |
| 신 | 2478 | 1.6% |
| 고 | 2478 | 1.6% |
| 사 | 301 | 0.2% |
| Other values (5) | 1501 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 157636 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 상 | 66339 | |
| 경 | 41364 | |
| 중 | 13397 | 8.5% |
| 해 | 9100 | 5.8% |
| 없 | 9100 | 5.8% |
| 음 | 9100 | 5.8% |
| 부 | 2478 | 1.6% |
| 신 | 2478 | 1.6% |
| 고 | 2478 | 1.6% |
| 사 | 301 | 0.2% |
| Other values (5) | 1501 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 157636 |
Most frequent character per script
Hangul
| Value | Count | Frequency (%) |
| 상 | 66339 | |
| 경 | 41364 | |
| 중 | 13397 | 8.5% |
| 해 | 9100 | 5.8% |
| 없 | 9100 | 5.8% |
| 음 | 9100 | 5.8% |
| 부 | 2478 | 1.6% |
| 신 | 2478 | 1.6% |
| 고 | 2478 | 1.6% |
| 사 | 301 | 0.2% |
| Other values (5) | 1501 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 157636 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 상 | 66339 | |
| 경 | 41364 | |
| 중 | 13397 | 8.5% |
| 해 | 9100 | 5.8% |
| 없 | 9100 | 5.8% |
| 음 | 9100 | 5.8% |
| 부 | 2478 | 1.6% |
| 신 | 2478 | 1.6% |
| 고 | 2478 | 1.6% |
| 사 | 301 | 0.2% |
| Other values (5) | 1501 | 1.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.5 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 50285 | |
| True | 16655 | 24.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 10 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200820 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 66930 | |
| 1.0 | 10 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 66930 | |
| 1.0 | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 133870 | |
| . | 66940 | |
| 1 | 10 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 133880 | |
| Other Punctuation | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 133870 | |
| 1 | 10 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 133870 | |
| . | 66940 | |
| 1 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 133870 | |
| . | 66940 | |
| 1 | 10 | < 0.1% |
고속국도사고여부
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 241 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200820 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 66699 | |
| 1.0 | 241 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 66699 | |
| 1.0 | 241 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 133639 | |
| . | 66940 | |
| 1 | 241 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 133880 | |
| Other Punctuation | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 133639 | |
| 1 | 241 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 133639 | |
| . | 66940 | |
| 1 | 241 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 133639 | |
| . | 66940 | |
| 1 | 241 | 0.1% |
음주사고여부
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 3967 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200820 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 62973 | |
| 1.0 | 3967 | 5.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 62973 | |
| 1.0 | 3967 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 129913 | |
| . | 66940 | |
| 1 | 3967 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 133880 | |
| Other Punctuation | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 129913 | |
| 1 | 3967 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 129913 | |
| . | 66940 | |
| 1 | 3967 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 129913 | |
| . | 66940 | |
| 1 | 3967 | 2.0% |
무면허사고여부
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1009 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200820 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 65931 | |
| 1.0 | 1009 | 1.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 65931 | |
| 1.0 | 1009 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 132871 | |
| . | 66940 | |
| 1 | 1009 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 133880 | |
| Other Punctuation | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 132871 | |
| 1 | 1009 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 132871 | |
| . | 66940 | |
| 1 | 1009 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 132871 | |
| . | 66940 | |
| 1 | 1009 | 0.5% |
뺑소니사고여부
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 523.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1293 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200820 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 65647 | |
| 1.0 | 1293 | 1.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 65647 | |
| 1.0 | 1293 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 132587 | |
| . | 66940 | |
| 1 | 1293 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 133880 | |
| Other Punctuation | 66940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 132587 | |
| 1 | 1293 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 66940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 200820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 132587 | |
| . | 66940 | |
| 1 | 1293 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 132587 | |
| . | 66940 | |
| 1 | 1293 | 0.6% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| 사고번호 | 사고년도 | 사고월 | 사고일 | 사고시각 | 사고요일 | 시군구_대범주 | 시군구_소범주 | 사고내용 | 사망자수 | 중상자수 | 경상자수 | 부상신고자수 | 사고유형_대범주 | 사고유형_소범주 | 도로형태_대범주 | 도로형태_소범주 | 노면상태_대범주 | 노면상태_소범주 | 법규위반 | 기상상태 | 가해운전자차종 | 가해운전자남성여부 | 가해운전자연령 | 가해운전자상해정도 | 피해운전자차종 | 피해운전자남성여부 | 피해운전자연령 | 피해운전자상해정도 | 주말여부 | 대형사고여부 | 고속국도사고여부 | 음주사고여부 | 무면허사고여부 | 뺑소니사고여부 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | A2019010100100001 | 2019 | 1 | 1 | 0.000000 | 1 | 강서구 | 강서구 방화동 | 경상사고 | 0 | 0 | 1 | 0 | 차대사람 | 차대사람 - 횡단중 | 교차로 | 교차로 - 교차로횡단보도내 | 포장 | 포장 - 건조 | 보행자보호의무위반 | 맑음 | 승용 | True | 26 | 상해없음 | 보행자 | True | 40 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 1 | A2019010100100002 | 2019 | 1 | 1 | 0.000000 | 1 | 구로구 | 구로구 고척동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 추돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 이륜 | True | 23 | 상해없음 | 승용 | True | 71 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 2 | A2019010100100003 | 2019 | 1 | 1 | 0.000000 | 1 | 서초구 | 서초구 서초동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 기타 | 기타 | 기타 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 33 | 상해없음 | 화물 | True | 51 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 3 | A2019010100100019 | 2019 | 1 | 1 | 0.041667 | 1 | 중구 | 중구 회현동2가 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 측면충돌 | 단일로 | 단일로 - 터널 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 58 | 상해없음 | 승용 | True | 62 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 4 | A2019010100100020 | 2019 | 1 | 1 | 0.041667 | 1 | 성동구 | 성동구 행당동 | 경상사고 | 0 | 0 | 1 | 0 | 차대사람 | 차대사람 - 횡단중 | 교차로 | 교차로 - 교차로부근 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 30 | 상해없음 | 보행자 | True | 32 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 5 | A2019010100100021 | 2019 | 1 | 1 | 0.041667 | 1 | 송파구 | 송파구 잠실동 | 경상사고 | 0 | 0 | 4 | 0 | 차대차 | 차대차 - 추돌 | 교차로 | 교차로 - 교차로부근 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 31 | 상해없음 | 승용 | True | 37 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 6 | A2019010100100022 | 2019 | 1 | 1 | 0.041667 | 1 | 노원구 | 노원구 공릉동 | 경상사고 | 0 | 0 | 3 | 0 | 차대차 | 차대차 - 추돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 49 | 상해없음 | 승용 | True | 27 | 경상 | False | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
| 7 | A2019010100100023 | 2019 | 1 | 1 | 0.041667 | 1 | 노원구 | 노원구 상계동 | 경상사고 | 0 | 0 | 5 | 0 | 차대차 | 차대차 - 추돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 29 | 상해없음 | 승용 | True | 47 | 경상 | False | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
| 8 | A2019010100100041 | 2019 | 1 | 1 | 0.083333 | 1 | 강남구 | 강남구 삼성동 | 경상사고 | 0 | 0 | 2 | 0 | 차대차 | 차대차 - 기타 | 교차로 | 교차로 - 교차로안 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | False | 28 | 상해없음 | 승용 | True | 59 | 경상 | False | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
| 9 | A2019010100100042 | 2019 | 1 | 1 | 0.083333 | 1 | 강남구 | 강남구 논현동 | 경상사고 | 0 | 0 | 1 | 0 | 차대사람 | 차대사람 - 기타 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | False | 30 | 상해없음 | 보행자 | True | 47 | 경상 | False | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
Last rows
| 사고번호 | 사고년도 | 사고월 | 사고일 | 사고시각 | 사고요일 | 시군구_대범주 | 시군구_소범주 | 사고내용 | 사망자수 | 중상자수 | 경상자수 | 부상신고자수 | 사고유형_대범주 | 사고유형_소범주 | 도로형태_대범주 | 도로형태_소범주 | 노면상태_대범주 | 노면상태_소범주 | 법규위반 | 기상상태 | 가해운전자차종 | 가해운전자남성여부 | 가해운전자연령 | 가해운전자상해정도 | 피해운전자차종 | 피해운전자남성여부 | 피해운전자연령 | 피해운전자상해정도 | 주말여부 | 대형사고여부 | 고속국도사고여부 | 음주사고여부 | 무면허사고여부 | 뺑소니사고여부 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 66930 | A2020123100100500 | 2020 | 12 | 31 | 0.833333 | 3 | 서초구 | 서초구 반포동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 후진중충돌 | 단일로 | 단일로 - 지하차도(도로)내 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 39 | 상해없음 | 승용 | False | 32 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66931 | A2020123100100526 | 2020 | 12 | 31 | 0.875000 | 3 | 금천구 | 금천구 시흥동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 측면충돌 | 교차로 | 교차로 - 교차로안 | 포장 | 포장 - 건조 | 기타 | 맑음 | 승용 | True | 27 | 상해없음 | 승용 | True | 50 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 66932 | A2020123100100528 | 2020 | 12 | 31 | 0.875000 | 3 | 강서구 | 강서구 화곡동 | 중상사고 | 0 | 1 | 0 | 0 | 차대차 | 차대차 - 기타 | 교차로 | 교차로 - 교차로부근 | 포장 | 포장 - 건조 | 신호위반 | 맑음 | 원동기 | True | 20 | 중상 | 승용 | True | 52 | 상해없음 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66933 | A2020123100100529 | 2020 | 12 | 31 | 0.875000 | 3 | 강동구 | 강동구 천호동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 측면충돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전거리미확보 | 맑음 | 승용 | True | 70 | 상해없음 | 승용 | True | 25 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66934 | A2020123100100530 | 2020 | 12 | 31 | 0.875000 | 3 | 송파구 | 송파구 잠실동 | 중상사고 | 0 | 2 | 0 | 0 | 차대차 | 차대차 - 추돌 | 단일로 | 단일로 - 교량 | 포장 | 포장 - 건조 | 안전거리미확보 | 맑음 | 승용 | True | 46 | 상해없음 | 승용 | False | 47 | 중상 | False | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
| 66935 | A2020123100100570 | 2020 | 12 | 31 | 0.916667 | 3 | 성북구 | 성북구 보문동1가 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 측면충돌 | 교차로 | 교차로 - 교차로부근 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 이륜 | True | 24 | 상해없음 | 승용 | True | 54 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66936 | A2020123100100571 | 2020 | 12 | 31 | 0.916667 | 3 | 동대문구 | 동대문구 제기동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 기타 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 화물 | True | 35 | 상해없음 | 자전거 | True | 41 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66937 | A2020123100100572 | 2020 | 12 | 31 | 0.916667 | 3 | 강동구 | 강동구 강일동 | 경상사고 | 0 | 0 | 1 | 0 | 차대차 | 차대차 - 정면충돌 | 교차로 | 교차로 - 교차로부근 | 포장 | 포장 - 건조 | 중앙선침범 | 흐림 | 승용 | True | 61 | 상해없음 | 승용 | True | 21 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66938 | A2020123100100592 | 2020 | 12 | 31 | 0.958333 | 3 | 송파구 | 송파구 신천동 | 중상사고 | 0 | 1 | 0 | 0 | 차대차 | 차대차 - 측면충돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 불법유턴 | 맑음 | 승용 | True | 62 | 상해없음 | 이륜 | True | 22 | 중상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 66939 | A2020123100100593 | 2020 | 12 | 31 | 0.958333 | 3 | 양천구 | 양천구 신월동 | 경상사고 | 0 | 0 | 4 | 0 | 차대차 | 차대차 - 측면충돌 | 단일로 | 단일로 - 기타 | 포장 | 포장 - 건조 | 안전운전불이행 | 맑음 | 승용 | True | 61 | 상해없음 | 승용 | True | 22 | 경상 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |